An automated linguistic knowledge-based cross-language transfer method for building acoustic models for a language without native training data
نویسندگان
چکیده
In this paper we describe an automated, linguistic knowledgebased method for building acoustic models for a target language for which there is no native training data. The method assumes availability of well-trained acoustic models for a number of existing source languages. It employs statistically derived phonetic and phonological distance metrics, particularly a combined phonetic-phonological (CPP) metric, defined to characterize a variety of linguistic relationships between phonemes from the source languages and a target language. Using these metrics, candidate phonemes from the source languages are automatically selected for each phoneme of the target language and acoustic models are constructed. Our experiments show that this automated method can generate acoustic models with good quality, far above the general phoneme symbol-based crosslanguage transfer strategy, reaching the performance of models generated through acoustic-distance mapping.
منابع مشابه
Lexicalization vs. Vocalization: A Cross-Linguistic Study of Emphasis in English and Persian
Language is a system of verbal elements that makes communication of meaningspossible in the manners the users intend by employing certain linguistic deviceswhich are partly language-specific. Once communicating cross-linguistically, thereis always a risk of negative transfer of techniques or processes from the firstlanguage (L1) to the foreign language (L2). The current study investigates the“e...
متن کاملCross-Linguistic Transfer or Target Language Proficiency: Writing Performance of Trilinguals vs. Bilinguals in Relation to the Interdependence Hypothesis
This study explored the nature of transfer among bilingual vs. trilinguals with varying levels of competence in English and their previous languages. The hypotheses were tested in writing tasks designed for 75 high (N= 35) vs. intermediate (N=40) proficient EFL learners with Turkish, Persian, English and Persian, English linguistic backgrounds. Qualitative data were also collected through some ...
متن کاملInvestigating Iranian Language Learners’ Use of Circumlocution for Culture-Specific Referents
This study investigated Iranian English language learners’ use of circumlocution for culture-specific referents. A discourse completion test (DCT) was designed in English and Persian, consisting of items dealing with Iranian culture-specific notions and distributed among 3 groups. The Persian language group received the Persian version, whereas the English language learners, divided into high a...
متن کاملRapid Building of an ASR System for Under-Resourced Languages Based on Multilingual Unsupervised Training
This paper presents our work on rapid language adaptation of acoustic models based on multilingual cross-language bootstrapping and unsupervised training. We used Automatic Speech Recognition (ASR) systems in the six source languages English, French, German, Spanish, Bulgarian and Polish to build from scratch an ASR system for Vietnamese, an underresourced language. System building was performe...
متن کاملGenre analysis of literature research article abstracts: A cross-linguistic, cross-cultural study
Following Swales’s (1981) works on genre analysis, studies on different sections of Research Articles (RAs) in various languages and fields abound; however, only scant attention has been directed toward abstracts written in Persian, and in the field of literature. Moreover, claims made by Lores (2004) regarding the correspondence of two types of abstracts with different ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005